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Abstract. The Wong-Zakai theorem asserts that ODEs driven by "reason- 
able" (e.g. piecewise linear) approximations of Brownian motion converge to 
the corresponding Stratonovich stochastic differential equation. With the aid 
of rough path analysis, we study "non-reasonable" approximations and go be- 
yond a well-known criterion of [Ikeda-Watanabe, North Holland 1989] in the 
sense that our result applies to perturbations on all levels, exhibiting addi- 
tional drift terms involving any iterated Lie brackets of the driving vector 
fields. In particular, this applies to the approximations by McShane ('72) and 
Sussmann ('91). Our approach is not restricted to Brownian driving signals. 
At last, these ideas can be used to prove optimality of certain rough path 
estimates. 



1. Preliminaries 

1.1. Rough differential equations. Let a G (0,1]. A weak geometric a-H61der 
rough path x over R d is a continuous path on [0, T] with values in G^ 1 /™' (R d ) , the 
step0-[l/a] nilpotent group over R d , of finite a-H61der regularity relative to d, the 
Carnot-Caratheodory metric on G^ (jH d ) , i.e. 



c?(x s ,x t ) 

0<s<t<T 



l X L-Hol;[0,T] = _ SU P_ | t _ a |« < °°- 



For orientation, let us discuss the case a G (1/3,1/2), which covers Brownian 
motion (for details see [IWu5] , |L3^ , [LCL07j . [F\%8bl L We realize G 2 (R d ) as 
the set of all (a, b) G R d © R dxd for which Sym (b) = a® 2 /2. (This point of view 
is natural: a smooth R d - valued path x = (x|J enhanced with its iterated 

integrals L dx l u dx J s , gives canonically rise to a G 2 (R d ) -valued path.). Given 
(a, b) G G 2 (R d ) one gets rid of the redundant Sym (b) by (a, b) i-> (a, b - a®/2) G 
R d ©so (d). Applied to a; enhanced with its iterated integrals over [0, t] this amounts 
to look at the path x and its (signed) areas J Q * x l s dx J s — J Q * x J s dxl , i, j G {1, . . . , rf}Q. 
Without going in too much detail, the group structure on G 2 (R d ) can be identified 
with the (truncated) tensor multiplication and is relevant as it allows to relate 
algebraically the path and area increments over adjacent intervals; the mapping 
(a, b) i— > (a, b — a®/2) maps the Lie group G 2 (R d ) to its Lie algebra; at last, the 
Carnot-Caratheodory metric is defined intrinsically as (left-) invariant metric on 
G 2 (R d ) and satisfies \a\ + \b\ 1/2 < d ((0, 0) , (a, b)) < \a\ + \b\ 1/2 . 
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gives the integer part of a real number. 
2 Given an Interval I = [a, b] , for brevity we write xj = x a j, = — x a - 

1 



Non-standard approximations of the Ito-map 



One can then think of a geometric a-H61der rough path x as a path x : [0, T] — > 
M. d enhanced with its iterated integrals (equivalently: area integrals) although the 
later need not make classical sense. For instance, almost every joint realization 
of Brownian motion and Levy's area process is a geometric a-H61der rough path. 
Lyons' theory of rough paths then gives deterministic meaning to the rough differ- 
ential equation (RDE) 

dy = V (y) dx, y (0) = y 

for Lip r -vector fields (in the sense of SteirQ), r > 1/a > 1, and we write yt = 
ir (0,yo;x) t for this solution. By considering the space-time rough path x = (t, x) 
and V = (Vo, V\, Vd) one can consider RDEs with drift. Although well studied 
[LV06] . with a few towards minimal regularity assumptions on Vo, we shall need 
certain "Euler" estimates |FV08aj for RDEs with drift which are not available in 
the current literature. The " Doss-Sussmann method" (implemented for RDEs in 
section [3]) will provide a quick route to these estimates. 

1.2. Standard and non-standard approximations. Assume we are given a 
weak geometric a-H61der rough path x and a path p that takes values in the center 
of G N N some integer N > [1/a] (think of the path p as a perturbation of 

our original path x). Further assume that p is (3- Holder continuous. It is a well- 
known Lyo98 that x can be lifted uniquely to a a-H61der path Sn (x) with values 
in G N (R ) . Then, 

S N (x) 0( . ® p g C"niii(a,/3)-H61 jij ^ G N fpd^ 

From general facts of such spaces (e.g. [FV06b| , relying only on the fact that 
qN ^d^j j g a g eoc j es i c space) we can find a sequence of Lipschitz continuous paths, 
x n : [0,T] -> R d , so that 

doo [S N (x n ) , S N (x)„ . ® p) -> as n -> oo and sup |]Sjv ( a; ™)ll m inf a a) H6I < °°- 

(The approximations x n are constructed based on geodesies associated to the G N (R d )- 
valued increments of Sn (x) ® p.) By interpolation, it then follows that for all 
7 < min (a, 0), 

(Sn (x n ) , Sn (x) (8> p) — > as n — > oo . 

The interest of such a construction is that the limiting behaviour of ODEs driven 
by the x n , provided [I/7] > N and T > 1/7, exhibits additional drift behaviour in 
terms of the Lie brackets of the driving vector fields and the perturbation p (cf. 
sections 13.11 and [3]) . 

We may apply this to Brownian and Levy area, i.e. x = B (uj) = exp (B + A) , in 
which case the approximations x n are constructed in a purely deterministic fashion 
based on the realization of the Brownian path and its Levy area. Interesting as it 
may be, this is not fully satisfactory as it stands in contrast to (probabilistic) non- 
standard approximation results (McShane }McS72j . Sussmann |Sus91| . ...) which 
have the desirable property that the approximations depend only on (finitely many 
points of) the Brownian path. The first aim of this paper is to give a criterion 
that covers all these examples in a flexible frame-work of random rough paths. En 

^i.e. a function is Lip 7 if it is [7] -times ( [.J = [.] — 1 on integers, otherwise equal) differentiable, 
the LtJ th derivative is 7 — [7] -Holder continuous and the function and all its derivatives are 
bounded. 
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passant, this allows for a painfree extension to various non-Brownian driving signals 
(cf . remark [T]) . We then give a rigorous analysis on how to translate RDEs driven 
by Sn (x) ® p to RDEs driven by x in which see the appearance of additional drift 
vector fields, obtained as contraction of iterated Lie brackets of V = (Vi, . . . , V4) 
and the components of p. (At this stage, we need a good quantitative understanding 
of RDEs with drift). At last, as a spin-off of these ideas, we show in section POl 
optimality of certain rough path estimates, answering a question that left was open 
in [OFV08] . 

2. (Non-Standard) Approximations 

2.1. Criterion for convergence to a "non-standard" limit. 

Theorem 1. Let a, (3 £ (0,1] and N > [I/7] for 7 := min(a,/3). Also, let x £ 
(ja-Hoi ^ ^ Q[i/a] (R d )) ; write x — 7Ti (x) for its projection to a path with values 
in M. d , and assume that there exists a sequence of dissections (D n ) = (£" : i) of 
[0, T] , such that 

sup H-%/0] (a^L „.., < 00 and d^ (S [1/a] (x D ") ,x) -^oo 0, 

nefi 

where we write x Dn for the usual piecewise linear approximation to x based onD n . 
Let {x n ) C C 1 -" 61 ([0,T] ,R d ) such that, for all t £ D n , 

p? :=S N {x n )^®S N (x D %l 

takes values in the center of G N . 

(i) // there exists c\, ci, C3 £ [1, 00) such that for all t™, £ D n 

I^ n |l-ff67;[t ? ,t ?+1 ] ^ Cll^ll-^^.t^+^l^+l-*?!^ 1 and 

\\p",t\\ < c 3 \t- sf for all s,t E D n . 
Then there exists a C — C (a, (3, c\ ||x|| ff » ; , C2, T, iV) smc/i that 

sup\\S N (x n )\\ HSl < C (sup \\S [1/a] (x Dn )\\ +C3 + 1 ) < 00 

n€N \nEN J 

(ii) 7/p™ — > pt for all t £ L) n D n and U n D n is dense in [0, T] then p extends to 
a continuous (in fact, (3-Holder continuous) path with values in the center 
of G N (R d ) and for all t £ [0, T], 

d(s N {x n ) ot ,S N {yi\ t ®v ^ < d(s N (x D ") ot ,S N (x) ^ 

+d(po,t.Po,t) ■ 

and converges to as n — > 00. 
In particular, if the assumptions of both (i) and (ii) are met then 

doc {S N (x n ) , S N (x) <g> p) -» n^ooO, 
snp\\S N (x n )\\ H5l < 00. 

and by interpolation, for all 7' < 7, 

d^'-Hdi (Sn (x n ) , S N (x) ® p) ->„^oo 0. 
3 
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Proof, (i) Take s < t in [0, T]. If s,t £ we have by our assumption on 



ll-HoHti.ii 



Sn (x n ) s ,t 



< \t-s 
= \t-s 

< \t-s 

< \t-s 

< \t-s 



\\S N (a 



ll-Hol;[i$VJ 



ll-Hol; t?,t? 



Cl 



- c 2 - 

J I ™ I I 4.U I a i I j.n j.n I P — 1 1 

\ c i Fla-H6i -*i I + c 2|i l+ i-^:| / 
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with Co = Co (a, /3, ci ||x|| a H61 , C2, T). Otherwise we can find < t™ so that 



s < tf < t] < t and 



S N (x n ) s ,t <\t-s\ J 2Co+ S N (x n ) t „. 

Using estimates for the Lyons-lift x i— > Sn (x), |Lyo98[ theorem 2.2.1], we can 
further estimate 



Sn (x n ) t „ 



< 



< 



S'at (: 



A 1 



D 



P&.tS 



CJV,c \\S[l/ a ] ( xD ")\\ a . 



j-n >n 

H61 I L j b i 



+ C 3 |t 
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t 



< (cN,a\\S ll/a] (x £, ")|| Q _ H61 + C 3 )|i-s| 

L '" * 'I Hoi < 00 by assumption, the proof of the uniform 



and, since sup„ |p[i/a] \ x 
Holder bound is finished, 
(ii) By assumption, p" is uniformly /3-H61der. By a standard Arzela-Ascoli type 
argument, it is clear that every pointwise limit (if only on the dense set U n D n ) is 
a uniform limit. [3- Holder regularity is preserved in this limit and so p is /3-Holder 
itself. Since t £ U n Z? n ,p™ take values in the step-iV center for all t £ D n , U n D n 
is dense, and so it is easy to see that p. takes values in the step- TV center for all 
t £ [0, T\. Now take t £ D n . Since elements in the center commute with all elements 
in G N (R d ) we have 

d \Sn (x n ) 0yt ) Sn (x) 0)i ® po,tJ 
S N (x n )o.t <8> S N (x D ") ot ® p£ t ® Sjv (a^")^ ® SW (x) o t ® (p£,t) _1 ® Po,t 
5jv (^ n )oj ® SW (x) 0jt <8> (Po,t) _1 ® Po,t 
d (5jv (^") , t - S N (x) 0jt ) + d (pj, t , po.t) 



< 



On the other hand, given an arbitrary element t £ [0, T] we can take t n to be the 
closest neighbour in D n and so 



'Po,t 



d(s N (x n ) 0tt ,S N (x) 
d (S N (x D ") Q t , S N (x) 0)t ) + 2d (p£ t , p 0it ) 

(x) 1 ® S N (x n ) o t , <S (x) 1 „ <g> 5 W (x n ) 1 „ 
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From the assumptions and Holder (resp. uniform Holder) continuity of S (x) (resp. 
Sn (x n )) we see that d (^Sn (x n ) t , Sn (x) t ® Po,tj — * 0, as required. □ 

Corollary 1. Let a, (3 G (0,1], N > [1/ min (a, (3)}. Also, letX{uj) e C^ mi ([0, T] , G^ 1 '^ 
write X = 7ri (X) /or its projection to a process with values in Wi d , and assume 
that 

V q eN:sup\\\S [1/a] (X D -)\\ aHdl \ < oo 

new 

doc (Sti/ a j (X s ™ ) , X) — > in probability as n — > oo. 
Lei (X n H)„ eN C C 1 "™ ([0, 1] ,R d ) such that, for all uj and all t G D n 

p?H :=sw* n )o,t®M* DB )o,! 

tafces values in the center of G N (R d ) . 

(i) If there exists ci, 02,03 £ [l,oo) suc/i that for alltf,tf +1 G D n , all lo and 
all q G [1, 00), 

l X ™ll-ffi>i;fcv"l ^ c l l XlJn |l-fl-6fcrt?.t?,J +C2 



i+U 



P?,t||L,™ < c,|t-a|^ /or a//s,t G [0,T] 



i/ien, /or all 7 < min (a, (3), 



Vg : sup 



11^(^)11 



< OO. 

Li(P) 



(ii) //P™ — > Pt probability for all t G U„£N-Dri oaid U ne fqD n is dense in [0, T] 
then 

d (j3 N (X n ) Q t , S N (X) t ® P ,tJ -> in probability 

In particular, if the assumptions of both (i) and (ii) are met then, for all 7 < 
min (a, (3), 

dj-Hoi (S N (X n ) , S N (X) ®P)^0mL' /or all q G [1, 00). 



Proof, (i) By a standard Garsia-Rodemich-Rumscy or Kolmogorov argument, the 
assumption on F 
q G [1, 00) so that 



assumption on | ||P" t || \ Lq f P \ implies, for any < ft, the existence of C3 G L q for all 



Vs < t in [0, T] : ||P"J| < C 3 (w) |t - s 



i/3 



We apply theorem [T] with instead of (3 and learn that there exists a deterministic 
constant C such that 

sup||S^(X")|| mMa3) . HS1 <C'(sup||5 [1/a] (X D ™)|| a _ Hel + l + C 3 ) ■ 

Taking L 9 -norms finishes the uniform L 9 -bound. (We take (3 large enough so that 
min (a, pj > 7.) 

(ii) From theorem [1] 

d (S N (X«) 0)t , 5 W (X) 0>t ® P 0lt ) < d (s* (X D ») i , S„ (X) 0it ) + d (P^.Po.t) 
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which, from the assumptions, obviously converges to (in probability) for every 
fixed t G U n D n . From general facts, of ^-convergence of rough paths (cf. |FV07| 
Appendix]) this implies the claimed convergence. (Inspection of the proof shows 
that convergence in probability for all t in a dense set of [0,T] is enough.) □ 

Remark 1. Although at first sight technical, our assumptions are fairly natural: 
firstly, we restrict our attention to (random) Holder rough paths X which are the 
limit of "their (lifted) piecewise linear approximations" . This covers the bulk of 
stochastic processes which admit a lift to a rough path including semi-martingales 
CL05, FV06a . fractional Brownian motion with H > 1/4 and many other Gauss- 
ian processes, CQ02[ IFV07] . as well as Markov processes with uniformly elliptic 
generator in divergence form (5tr88J, Lej02 ; |FV06c]. 

Secondly, the assumptions on X n andP n guarantee that X n remains, af min (a, /?)- 
Holder scale, comparable to the piecewise linear apprxoximations. In particular, the 



assumption on \X n \ 1 H -,i 



X' 



is easy to verify in all examples 



below. The intuition is that, if we assume that X n runs at constant speed over any 
interval I = D n = (tf), it is equivalent to saying that 

length{X n \i) < c 1 length(X D "\ I ) + c 2 \lf 

( = Cl \x t ^ ri+1 \+c 2 \t? +1 -t?f) 

2.2. Examples. 

2.2.1. Sussmann Sus91 . Take any sequence of dissection of [0, T] , say (D n ) with 
mesh \D n \ — > and think of X (w) as Brownian motion plus Levy's area so that 
tti (X) = X is G?-dimensional Brownian motion. The piecewise linear approximation 
X Dn is nothing but the repeated concatentation of linear chords connecting the 
points (X t :te D n ). For some fixed v G g N H (R**)®^, N G {2, 3, . . . } we now 
construct Sussmann's non-standard approximation X n as (repeated) concatenation 
of linear chords and "geodesic loops". First, we require X n (t) = X (t) for all 
t e D n = (i" : i). For intermediate times, i.e. t € (pi-i>tf) for some i we proceed 
as follows: For t € + *?) I'A we run linearly and at constant speed from 

X (tf^) such as to reach X (tf) by time (tf_ x + tf) /2. (This is the usual linear 
interpolation between X and X (t™) but run at double speed.) This leaves 

us with the interval [(tf_ x +t") /2,t™] for other purposes and we run, starting 
at x(t?) G R d , through a "geodesic" £ : /2,tf] -> R d associated to 

exp (v/ \tf - t?^) G G N (R d )E Since N > 1, tti (exp (v/ \t n - = and so 

this geodesic path returns to its starting point in R d ; in particular 

X" ((*?-! +*")/2) =*(*?)• 
It is easy to see (via Chen's theorem) that this approximation satisfies the assump- 
tions of corollary Q] with 

P» t := S N (x n ) s t ® S N (x D ~)~l = e v ^ V.s, t G D n , 

(so that |P™ t | Lg = |P" t | < \t — s| 1/,JV first for all s, t G D n and then, easy to see, for 
all s, t) and deterministic limit Po,t = e vt , (3 = 1/N. Indeed, the length of x n over 



4 A path 5 : [a, b] — » M d is a geodesic associated to g S G N (R d ) if 5jv (£) a 6 = S anc ' ? ^ as 
lenght equal to the Carnot-Caratheodory norm of g. See IFV06bl and the references therein. 
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any interval I = [tf_ 1 ,tf] is obviously bounded by the length of the corresponding 
linear chord plus the length of the geodesic associated to exp (v/2 n ) = exp (v/ \I\), 
which is precisely equal to 

||exp(v/|/|)|| = |/| 1/Ar ||exp(v)|| =: c 2 \I\ 1/N . 

Obviously, nothing here is specific to Brownian motion. An application of corollary 
[1] gives the following convergence result which, when applied to Brownian motion 
and Levy area and in conjunction with theorem [T] below, implies Sussmann's non- 
standard approximation result for stochastic differential equations. 

Proposition 1 (Rough path convergence of Sussmann's approximation). Let X (ui) G 
Cg- HSl ([0,T] (R d )) be a (random, a-Hdlder rough path) which is the limit 

of its piecewise linear approximation (as detailed in corollary [7]) . Then, for any 
7 < min (a, 1/iV) we have 

dj.HSl (Sn (X n ) , S N (X) $ e v ') -> in L q for all q E [1, oo). 

Strictly speaking, his construction did not rely on the concept of geodescis as- 
sociated to exp(v/2") E G N (R d ). Since exp(v/2 n ) is an element of the center 
of the group he can give a reasonably simple inductive construction of a piecewise 
linear "approximate geodesic" which is also seen to satisfy the assumptions of our 
theorem. We also note that he only discusses dyadic approximations and obtains 
on a.s. convergence result (which can be obtained by a direct application of theo- 
rem [T]). In conjunction with theorem [2] below, we then recover the main result of 
[Sus91j . 

2.2.2. McShane |McS72j . Given x E C ([0, T] , R 2 ), an interpolation function (f> — 
(0 1 > 2 ) E C 1 ([Q,1],M 2 ) with 4>{0) = (0,0) and 0(1) = (1,1) and a dissection 
D = D n = {U} of [0,T] we define the McShane interpolation x n E C ([0,T] ,M 2 ) 

by 



/A(M) 



t-tp 
t D -t D 



t o, i — 1, 2. 



The points tp,t D E D denote the left-, resp. right-, neighbouring points oft in the 
dissection and 

\ 3-i ,!f x\ D<tD xt D tD < 
As a simple consequence of this definition, for u < v in [tj, 
& (*")„,„ = exp« jt ,+A^) 
= exp I x u v + 



x u,ti 



x ti,u 



ti+-[ — ti t 



i+i 



where (u, v) = v is the area increment of (f> over [u, v] C [0, 1]. 
Consider now X(«)'= B (w) = exp (B + A) E C °- H51 ([0, 1] , G 1 ! 1 /"] (R 2 )) with 
a E (1/3, 1/2) and take any (D n ) neN with |D„| -> 0. (We know, e.g. from [FV07j . 
that lifted piecewise linear approximations, ^2 (B Drl } converges to B in l/a-H61der 
rough path topology and in L q for all q.) It is easy to see (via Chen's theorem) 
that McShane's approximation to two-dimensional Brownian motion satisfies the 

7 
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assumptions of corollary [T] with (3 — 1/2, N = 2. Indeed, writing S2 (B n ) — 
cxp (B n + A n ) it is clear that for any s < t 



X tn,t° 



A 4> 



s — tjj t — ti 



with D = D r , 



and for fj < tj elements of the dissection D = D n , 

PS,t,=«pf< E \Bi k>th+1 

\ k=i+l 



It is straight-forward to see that J2k=i+i 
to its mean 



B 



B 



B? 



converges, in L 2 say, 



2 . „ 2 

— / j (*fc+i ~tk) = — \tj — U 

IT ' ■ 7T 



while 
and 



pn 



k=i+l 
,1/2 



< c 9 I ;— 1 follows directly from -P t " 4j ~ f St=i+i 



5/ , 






tfc 5 tfe+l 




tfcitfe+1 



F k 1 1 



— / ^ t k, t k + l 

Lq k=i+l 



L" 



B 



L" 



|tj" 



■1/2 

In fact, || L g < c q \t — s\ 1 for all s, t since for u < v in [f,-, fj+ij 



11 L<J 



E 



A <t> 



v — u — ti 



ti+1 — ti 

At last, we easily see that, for any ti G D ni 



t'i+l — ti ti+l — t'i 
^ I 1 1/2 



q/2 s 



1/9 



\ Bn \l-H6\;[ti,t i+1 ] - k L \ BDn \l-H6\-[UM+i] ' 

This shows that all assumptions of the above theorem are satisfied and so we have 

Proposition 2 (Rough path convergence of McShane's approximation). For all 
a G [0, 1/2), 

da-HSl (S 2 (B n ) , exp {B t +A t + tT)) -» m L" for all q G [1, 00) 
where At the usual so (2)-valued Levy's area and 



-A? 1 

7T U,l 

-^01 

7T 0,1 



G so (2) 



We note that corollary Q] also applies to a (minor) variation of the McShane 
example given in |LL02j . 
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3. RDES WITH DRIFT: THE DOSS-SUSSMANN APPROACH 

3.1. Preliminaries on ODEs and flows. Consider an ordinary differential equa- 
tion (ODE), driven by a smooth R d - valued signal / (t) = (J 1 (i) , . . . , f d (t)) T on 
a time interval [to , T] along sufficiently smooth and bounded vector fields V = 
(Vi, Vd) and a drift vector field Vq 

dy = V (y) dt + V(y)df, y (t ) = y e M e . 

We also call U[^ t (yo) = Vt the associated flow. Let J denote the Jacobian of U. 
It satisfies the ODE obtained by formal differentiation w.r.t. yo- More specifically, 

Ub J £=0 

is a linear map from M e — > R e and we let j[t—t (Vo) denote the corresponding e x e 
matrix. It is immediate to see that 



d 



(3.1) -jL t0 (yo) = 

where • denotes matrix multiplication and 



■ JLt (yo) 



±Mf( y ,t)=j^V!(y)j t fi + Vl(y). 

i—l 

Note that also j{ 2 ^t ~ ^*2<-ti ' ^ti*-t a an d that j/_ t is invertible with inverse, 
denoted j{ ^-t> gi ven as the flow of (|3.1[) with / replaced by /* (.) = / (t — .), i.e. 



jf 



to 

fo<-t 



(0 = (4-to (•)) X = 4lt (-)- 
Now for V = (Vi, . .. , V d ) G Lip 2 (R e ) and xeC 1 ([0, T] , R d ) ODE theory tells 
us that dy = V (y) dxt has a C 2 -flow. Note that the flow 

yo >-» 7T( V ) (0,yo,x) t = C/^_ (yo) 

is even globally Lipschitz (thanks to the boundedness which is part of the Lip- 
definition). The associated Jacobian Jf<_ (■) ^ s itself C 1 and also globally Lipschitz 
in yo as well as its inverse Jp, t (.). This is more than enough to see that for 

Vq S Lip 1 

(J^ t (-)-V a (.))0 7T {V) (0,., X ) t 

is Lipschitz (in .), uniformly in t in [0, T]. Obviously, the above expression, as a 
function of (.,t), is also continuous in t. It follows that 

z t = Ja^t ( z t) ■ V (tt {v) (0, z u x t ) t ) 

has a unique solution started from z (0) = yo- An elementary computation shows 
that y (t) = t^(v) (0, Zt, x) t satisfies 

dy = Vo (y) dt + V (y) da;. 

This is the Doss-Sussmann method (cf. |RW0Q[ p. 180]) applied in a simple ODE 
context. 
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3.2. Doss Sussmann method for RDEs. We return to the discussion of section 
13.11 and define solutions of RDEs with drift as uniform limits of solutions of ODEs 
with drift. 

Definition 1. Let x eC a - HBl ([0, T] , G^/ a W) . If there exists a sequence (x n ) n£N 
of Lipschitz paths with uniform a— Holder bounds converging pointwise to x (that is 
x " = S [i/a] (x n ) t -> x t f or ever V t G [0,T] and sup„ eN ll^l/ap"!^.^ < oo) such 
that for each n G N the RDE (in fact ODE) with drift 

(3.2) dy n = V (y n ) dt + V (y n ) dx", y n (0) = y 

has a unique solution y n on [0, T\, then we call any limit point in uniform topology 
of {y n , n G N} a solution of the RDE with drift 

dy = V (y)dt + V(y)dx, y(0) = y 

and we also write y = i^(v,v ) (0, J/o! ( x >^)) f or this solution. 

Proposition 3 (Doss-Sussman for RDE). Assume that 

(i) x eC§;- Hdl ([0,T},G^/^ (R d )), a G (0,1/2), 

(ii) V G Lip 1 (R e ) and V = (V x , . . ., V d ) G Lip 7+1 (R e ) for a 7 > 1/a, 
(hi) y G R e . 

Then there exists a unique solution y to the RDE with drift 

(3.3) dy = V (y)dt + V(y)dx, y(0) = y . 
Further, this solution is a-Holder continuous and given as 
(3-4) y(t) = t/£_ (*t), 

(3.5) with z t = W{t,z t ), z(0)=y o , 

where W (t, .) = J*^ t ■ Vq (tt (v) (0, .;x) t ), E/£_ (■) = T(V) (0, .;x) t is the flow of 

dy = V (y) dx, y (0) = y 

and Jq ,' ' t = (Dir (0, .; 0) t ) 1 is the inverse of the Jacobian of Uf. . 

Proof. We hrst show that (|3.4p has a unique solution. Existence and uniqueness 
of tt(v) (0, ■; x) (and of a C 2 flow) follows from standard rough path theory so this 
boils down to check that the ODE (|3 . 5|) has a unique solution on [0, T]. However, 
the vector field W (t, y) is continuous in t and y and W (t, •) is Lipschitz continuous 
in •, uniformly in t: 

\W(t, y)-W (t, x) I - I J^ t ■ V (7r m (0, y- x) t ) - J^ t ■ V (tt (V) (0, x; x) t ) | 

+ \JoZ-t ■ Vo (tt (v) (0,x;x) t ) - J*^L t ■ V (n (v) (0,z;x) ( )| 

< \J£0- t \ \Vo\ Lip l^lLip k "I/I + Ki-tlup l« " 2/1 l^oloo ■ 

Thanks to the invariance of the Lipschitz norms \V (. + J/^Lip-H- 1 = 1^ (-)lLip 7+1 
and uniform estimates follow from a routine exercise showing that | 1 Lip , su~p y \ Jq 

and J£_ t | Llp are all bounded by a constant c = c fa, 7, |V| Lip7 +i , || x IL H 6i ; [o,T]) , 
uniformly in t. The desired Lipschitz regularity of W follows which implies existence 
of a unique solution z on [0, T] of (|3.5|) . 

To see that the path j/ t = Uf; (z t ) is the unique RDE solution to (|3 ,3[) let 
(a;") n>1 be a sequence of Lipschitz paths with uniform a— Holder bounds converging 
pointwise to x. For brevity set x™ = Sfi/a] {x 11 ). Following our discussion in section 
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13.11 the solutions y n — tt(v,v ) (0> J/o j ( x? \£)) are given by solving (|3.4[) and (|3.5p 
where x is replaced by x" (same reasoning as in the first part of this proof gives 
existence and uniqueness of solutions z n ). 

Note that by the universal limit theorem the map (y, x) — > it(y) (0, y, x) is uni- 
formly continuous on bounded seta^, so if we can show uniform convergence of z n to 
z the desired conclusion follows. A standard ODE estimate (a simple consequence 
of Gronwall) is 

M n 

sup \z?-zt\ < —(e Lt -l) 
te[o,T] L 

where M" = sup \W n (t,y) - W (t, y)\ and L = sup \ W ~W (t,y)\ ^ 
t£[0,T] te[o,T] F — 2/1 

From the first part of this proof, L < oo and to see M™ — > as n — > oo, recall that 
Jo^t -* J o^-t and (7T (V ) (0;2/; x ")t) -* Vb (tt (V) (0, y; x) t ) uniformly in j/,t as 



n — > oo by the universal limit theorer 

Finally, a-H61der continuity of the solution follows from the estimate 

\w(V) (0,z t ;x) t -7T(v) (0, z s ;x) s | < |tt(^) (0, z t ; x) t - 7r (V ) (0, z t ; x) s | 

+ (0, z t ; x) s - 7T(v) (0, z s ; x) s 

(3.6) < Cl |t-s| a 



+ \t~ s\ a c 2 \V\ u y +i \zt-z s \ 



c2||x||«. H61 |y|° +1 



where ci = ci (a, 7, |V| Lip -,+i |ML H51;[0iT] ) , c 2 = c 2 (0,7). 



□ 



Remark 2. TTie above proof can be adapted to show uniqueness and (global) exis- 
tence of RDEs with linear drift term, i. e. 

Ay = Aydt + V (y) dx, y (0) = y , 

A a (e x e) -matrix, same assumptions as above on V and x. 

Corollary 2. Let x, Vb and V be as in Proposition® and let x" be a sequence of 
weak geometric a-Holder paths converging with uniform bounds to x in supremums 
norm, i.e. 

sup||x n || a .jftH < 00 

71 

doo (x n , x) — > as 71 — > 00 . 
Denote by y and y n the corresponding solutions of the RDE with drift \3. 3\) . Then 

s™p\y n \ a _ HSl < 00 

n 

sup \yt — y" \ — > as n — » 00. 
te[o,T] 

and 6y interpolation for every a' < a 

\y-y n \ a >-Hdl^° as ri ^00. 



(j/, x) seen as an element in a product space of two metric spaces, i.e. with metric 

A ((«. x ) 1 (5: x )) = \V ~ V\ + d a-W6l (x, x). 

^for the convergence of the Jacobian a localisation argument is actually needed. 
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Proof. The uniform Holder bounds follow as in (|3.6[) by noting that the z n are 
uniformly bounded. For every x™ there exists a sequence of Lipschitz paths {x n ' m ) m 
with S»/a] (x n ' m ) converging to x" in d<x, with uniform (in m) Holder bounds and 
with an associated sequence of RDE solutions y n < m converging to y n . One can 
choose a subsequence m n such that SVl/cJ (x" ,m ") converges to x with uniform 
Holder bounds and sup t y™' m " — y™ — > 0. Hence, 

- - ,,t,> " - " ' (I as // • OO. 



sup|y 4 -yt\< sup|y t - y" 

t i 



sup |y t 
t 



Vt 



□ 



Remark 3. TTie auxiliary differential equation for z can be written as 
(3.7) dz=(J^ t .W)oir(0,z t ,x) t dh 

where W — Vq and h(t) = t. In fact, one can take W = (Wi, . . . , Wd>), h £ 
c i-var ^T] ,R d ') in which case tt {v) (0,z t ,x) t solves 

dy = V (y) dx + W (y) dh. 

We could make sense of l{3.7\ ) as Young-integral equation as long as 1/p + 1/q > 1 
and thus obtain RDEs with drift-vector fields driven by h. In this case the pair 
(x, h) gives rise to a rough path (the cross-integrals of x and h are well-defined 
Young-integrals ); the advantage of the present consideration would be to reduce the 
regularity assumptions on W . 

3.3. Euler scheme for RDEs with drift. We recall the Euler scheme for RDEs, 
cf. [FV08aj . 

Definition 2. Let N G N. Given Lip N vector fields V = (Vj) j=1 d on K e , 



G G N (R e ) ,j/eR e . We call 



N 



k=l 

{l,...,d} 



the step-N Euler scheme (I denotes the identity map). 

Proposition 4 (Euler-estimate for RDEs with drift). Let N G N, N > 1/a. For 
x eC^ Hdl ([0,T] ) G N (R d )), V G Lip 1 , V = (Vi) i=1 ... d G Lip^ +1 (M e ), 7 > JV 



T(v ,v) {s,y s ; (x,t)) S;t - f ( y } (y s ,x s ,t) - V (y s ) < c\t- s\ D 



where 9 > 1 + a > 1 cmrf c = c ( a, -/V, 
y s G R e is a /lied "starting" point. 



Z \\a-Hol > \ V \u P N 1 l^oLipi 



Here 



Proof. This is an error estimate for RDEs with drift over the time-interval [s,i\. 
By shifting time, we may consider w.l.o.g. s = 0, and from FV08aJ we know that 

tt(v) (0, yo, x) 0)t - £<y) (y , x 0)t ) < c i e 

where c = c (a,N,y , ||x|| a . HSJ , |V| Lip w), = (N + 1) a > 1 + a. By the triangle 
inequality and our definition of " RDE with drift" it then suffices to show that 



K(v ,v) (0, y ; (x, i)) ot - 7T( V ) (0, y , x) 0<i - V (y ) * 
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To see this, recall 

*(v ,v) (0: Vo] (x, t)) Q>t = tt (v) (0, z t , x) ( 
where z t — z = z t — yo = V) (yo) t + O (t 2 ) . We then have 

tt {V) (0,z t ,x) t -tt( V ) (0,y o ,x) ( = / J^ol7r (v) (o,* s ,x) ot dz s 

Jo 

dz+ [J^— Q \ nv) (0,z 3 ,x.) o . ~ A dz s 

o Jo v 7 

= Vb(y ) * + i 1+a 

and the proof is finished. □ 

4. Applications 

4.1. Drift vector fields induced by perturbed driving signals. We now show 
that perturbations of a rough driving signal are picked up by the RDE as a drift 
term of iterated Lie brackets of the vector fields. Since the RDE solution is a 
continuous function of the driving signal, we also have continuity under convergence 
in probability (in suitable Holder rough path metrics) of random rough driving 
signals. Combined with the results of section [2] we arrive at a general criterion for 
non-standard convergence in stochastic differential equations, more general than 
|IW89[ [GM04 in the sense that our result applies to perturbations on all levels, 
exhibiting additional drift terms involving any iterated Lie brackets of the driving 
vector fields. In particular, the examples given in section [5] allows us to recover 
the convergence results of McShane [McS72j and Sussmann |Sus91j . (In fact, a free 
benefit of the rough path approach, the respective convergence results will take 
place at the level of stochastic flows.) 

Theorem 2. Let x : [0, T] — » G^^ a ^ (R d ) be a weak geometric a-Holder rough path, 
fixw = (v il '-' iN ). i . =1 d eV N (R d ) , N > 2 and set 

x s ^t = exp (logx M + vi) for s, t 

(This defines a weak geometric min {a, 1/N)- Holder rough path in G N (R d ) with 
identical projection as x to R d ). Further, assume Vo G Lip , V = (VjL =1 d & 
Lip 7+1 ,7 > max (1/a, iV) , vector fields on R e . Then the unique RDE solution of 

(4.1) dy = Vo (y) dt + V (y) dx, y (0) = y 
coincides with the unique RDE solution of 

(4.2) dz = {Vo (z) + W (z)) dt + V (z) dx, z (0) = yo 
where 



v 



l,-,l[l/a] 



We prepare the proof with 
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Lemma 1. Let k 6 N. Given a multi-index a = (a%, ...,afc) € and k 

Lip vector fields V\.,...,Vk on R e , define 

V a =[V ak ,[V ak _ 1 ,...,[V a2 ,V ai }]]. 

Further let ei, ed denote the canonical basis o/R d . TTien g ra (R d ), i/ie step-n free 
Lie algebra, is generated by elements of form 

e a = [e ah , [e a »_n [ea 2 ,e ai ]]] 6 (R^) 8 , k <n 
with [e{, Cj\ = e{® ej — Bj ® ei and 

E ^-Vi 1 (ea) i, -"' il =V a . 

ii,...,ijj,e{l, ...,<£} 

Proof. It is clear that g n (R d ) is generated by the e a . We prove the second state- 
ment by induction: a straightforward calculation shows that it holds for k = 2. 
Now suppose it holds for k — 1 and denote Va = [V ak _ 1 , ... [V a2 , V ai ]\ . Then (using 
summation convention) 



\ih,—,h 



—Vi k . . . Vi ± ( [e afe _ 1 , [e a2 , e Ql ]] ® e afc y 

= ^...^^• < *®[e Qfc _ 1 [e a2) e Ql ]] <fe ' ' ! 

-V^ [e Qfc _ 1 ,...,[e Q2 ,e ctl ]] l ' c ""' 12 <g> 5 

— Vi k . . . Vj 2 [e ajfc _ 1 , [eaj, e 0l ]] V^ fc 
= = [y Qt ,[l/ Qfc _ 1 ,...,[y Q2 ,K 1 ]]] 

(where we used the induction hypothesis that 

Vi k -i ■ ■ ■ Vii [ e Qi--i ! •••! [ e a 2 j e ai]\ = Vi k . . . Vi 2 [e akl , [e a2 , e Ql ]] 



□ 



Proof of Theorem^ By construction W E Lip 1 and existence and uniqueness of 
RDE solutions y,z to (|4.2p . (|4.1[) follows from proposition [3] We have to show 
that y t — z t for every t E [0, T]. Therefore fix T E [0,T], take a dissection _D = 

(^')i=o \d\ °f ^, w ith to = and t|£>| = T and define 

4 = K(V +W,V) (*i>S/t 4 5 CM)) S for s G *i,T C [0, 1] , i = 1, . . . , d 



Note that z? = z^ and z]?^ — y,f, hence 



(4.3) 



\D\ _ 



Using the Lipschitz continuity of RDE flows we get 

n(v +w,v) {U,Vu\ ( x ^))f - ^(Vo+w.y) ifi-uVu-i't ( x >*))f I 

TT(y 0+ W,V) {U,VU\ (x,t))f - 7T(y 0+W r V ) 7T(Vh+W,V) (*t-l j 2/ti-i 5 (*,*)) t . ! CM) J, 



z l ~ - z-" 1 

T T 
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< 



rr(x,t) 

f< — U 



Lip 



For brevity set a* — min (a, 1 /N) . By adding/substracting £ ( 
Vq (j/t^j) |£i — 4»— 1| and splitting up we estimate 



[1/a ' ] * 

< (1) + (2) with 



(!) = K(v ,V) (*i-i,!ft,_i;(x,i)) tj _ i](j -£(y) 1 (f*i-i> X *i-i.*i) - ^0(^-1) l*» — **— l 



(2) 



.-[!/"*] 
'(V) 



(yt*-!,^,^^) + v (vu-i) \U - ti-i\ - tt { vo+w,v) (*»-i)yti-i;( x !*)) ti _ I)ti 



From proposition 21 (1) < Ci |t — s| , > 1, and by lemma [TJ 



= f^* 1 (yt^^xt,.,,*,) + |t< - ti-il W (y^) . 
Again proposition |4] applies and (2) < C2 |i — s| e . Plugging all this into (|4.3|) gives 

_ 2/f - c s£ l*» ~ 

z=l 

with c 3 - c 3 (a, JV, ||x|| Q ._ H61 , ||x|| a . Ha] , |y| Lip[I/a ., , |Vb| LipI , \W\ Upl ). 
the sum on the r.h.s goes to as \D\ —> and this finishes the proof. 



Since 9 > 1 
□ 



4.2. Optimality of RDE estimates. At last, we use theorem [5] to establish op- 
timally of two important RDE estimates (proved in |Lyo98| and |FV08bj ) for the 
case of paths with p-variation, p E N. The second part of the following theorem 
settles a questions that was eft open in |CFV08j . 

Theorem 3. Let x : [0, T] — > G p (R d ) be a geometric p-rough path, y a E W,p £ N. 
If either 

(1) V = {Vt)ti e Lip 7 (R e ) , 7 > p or 

(2) V = (Vi) i=1 linear vector fields on M. d , i.e. V% (y) — Ai-y with At a (e x e)- 
matrix 

iften i/iere exists a unique RDE solution to 

dy = V(y)dx, y(Q) = y . 

Further, in case (1 ), 

( 4 -4) Moo < C™x(Mp-var;[s,t] , ll x ll£-«or;[.,t] ) 

with C = C (ya,p, |V"| LipT +i^ an< ^ * n case ft) 

(4-5) |yU <cexp(c||x||^ ar ) 

with c — c (^jo,p, (\Vi\)i—ij hold and both estimates are optimal (i.e. the bounds are 
attained). 



We prepare the proof with two lemmas 
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Lemma 2. For all k 6 N there exist k smooth, bounded vector fields V\, . . . ,Vk on 
R e such that 

[V k ,...,[V 3 ,[V 2 ,V 1 }}...] = 



dx K 



k—l brackets 

Proof. Set 



d d d 

V = sin:r e - — , W = - cosx e - — ,E = - — . 

ox e ox e ox e 

Note that [V, W} = E and [V, E] = W. Hence, 

for k odd: [V,..., [V, [V, W]]...]=E 

for k even : [V, . . . , [V, [V, E]} ...] = E. 

□ 

Lemma 3. For all k € N there exist (e x e) -matrices At, . . . , ^4fc (e >2) such that 

{[A k ,...,[A 3 ,[Ar S ,A 1 ]]...]) i j=i e = (-5(i,i)=(i,i) +%,j)=(e,e)) i , =1 e 
v ^ , 

fe— 1 brackets 

where [., .] denotes the usual matrix commutator. 
Proof. Let 

M = (%j) = (M)) lJ = l,...,e ' N = ( _ %,J)=(1.1) + 5 (ij)=(e,e)) i>j=l! ... !f , , 



A - (%,j)=(e,l)) i ,j=l,..., e > B = {\ 5 (i,j)={l,e)\ 



i,j=l,...,e 



Note that [A, M] — N and [B,N] = M. Hence 

for k odd : [A, [_B. [A,..., [A, [B, [A, M}]] . . .}}} = N 
for k even : [A, [B, [A,..., [A, [B, N]] . . .]]] = N. 



□ 



Proof of Theorem Existence of a unique RDE solution follows from |Lyo98| . For 
(|4.5|) use |Lyo98[ Theorem 2.4.1] with control function oj (s,t) = ||x||£_ var .j s ^ to get 

n>0 V 

where if = max^ (|-A<|). However, X)n>o (n/p)! — c ° ^ e c °( p ) a for a > and there- 
fore 



I2/Ioo,[o,t] S |2/o|ciexp^ci ||X||p_ 

Estimate (|L4]l is proven in [FV08bj . 

To see optimality of both (|4.4|) and (14. 5f> define a geometric 1/p- Holder rough 
path x on [0, 1] in G p (JR P ) by 

x t = exp (Xtet,..., p ) 

where et,...,p = [e p , . . . , [e^, [e^, ei]] . . .] 6 V (R p ) for some A > 0. Note that homo- 
geneity of the p- variation norm implies 

(4-6) l|x|Ur = c 2 A^. 
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The corresponding RDE solution 

dy = V(y)dx, j/(0) = y . 
then coincides with the ODE solution 
(4.7) dz = W (z) dt, z (0) = y 

where W is given by theorem [5] 

Case (1): .Let V = {Vi) i=l be the vector fields on W from lemma [2] with 
k = p. Then the solution (14. 7|) is easy to write down, 

y t = (0,...,0,A0 T • 

Clearly, jy^ = l/c| ||x||^_ var . To see that the regime of (|4.4J1 where ||x|| r domi- 
nates can be obtained is straightforward by looking at dy = (1, . . . , 1) T dx for any 
rough path x. 

Case (2): Let V = (Vi) i=1 be the linear vector fields on MP given by the 
matrices (Aj) i=1 of lemma El i.e. (y) = • y. Using lemma [3] with k = p 
bracket^, 

W(z) = X[V p ,...,[V 3 ,[V 2 ,V 1 }]...]I(z) 
= X[A p ,...,[A 3 ,[A 2 ,A 1 }]...] M -z 

= A (-^(i,j)=(l,l) + 5 (^j)=(p,p))iJ=l,...,e " Z 

Now choosing y = (0, . . . , 0, 1) T , the unique solution to (|4.7[) is 

y t = (0,...,0,e At ) T . 

Again by (l4~6)l 

sup |y t | = e A = e C2l|x|l ?-™ r 
te[o,i] 

and the upper bound of estimate (|4.5[) is attained. □ 
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